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Abstract. This paper studies the extreme gaps between eigenvalues 
of random matrices. We give the joint limiting law of the smallest gaps 
for Haar-distributed unitary matrices and matrices from the Gaussian 
Unitary Ensemble. In particular, the fcth smallest gap, normalized by a 
factor n 4 / 3 , has a limiting density proportional to x 3k ~ 1 e~ x . Concern- 
ing the largest gaps, normalized by n/^/logn, they converge in L p to a 
constant for all p > 0. These results are compared with the extreme 
gaps between zeros of the Riemann zeta function. 



1. Introduction 

We address here the following question: what is the asymptotic size and 
the limit laws of the smallest and largest gaps or spacings in the spectra of 
random matrices? Typical spacings between eigenvalues of random matrices 
have been well understood for invariant ensembles for quite some time. More 
recently, the behavior of these typical spacings has even been proved to 
be universal for much larger classes of random matrices ([Mj, [15], [2]). 
Much less is known for atypically large or small spacings. This question 
was first considered for the smallest spacings in the unpublished PhD thesis 
of J.Vinson (|35j) and raised by Persi Diaconis in [II?] for the largest ones. 
It was also discussed in an interesting debate during a conference at the 
Courant Institute in 2006, in honor of Percy Deift. We solve here completely 
the question of the smallest spacings for the simplest invariant ensembles, 
i.e the CUE and the GUE. We give the scaling and the limit laws for the 
joint distribution of the smallest spacings. The answer is simple to state: it 
is given by the trivial Poissonian ansatz where the spacings would be treated 
as i.i.d random variables. The answer we find for the largest spacings is less 
complete, since we can only obtain at this point a first-order approximation 
which gives the asymptotic size of the largest spacings and not their limit 
laws. We believe that the same Poissonian ansatz should work as well for the 
largest gaps, but this question is left open. The question of the universality 
of the behavior of the extremal spacings is also left open. 

This work was initiated while P.B. was visiting the Courant Institute, and continued 
in Telecom ParisTech: he wishes to thank both institutions for their kind hospitality and 
support. The work of the first author was supported by the NSF grants DMS-0806180 
and OISE-0730136. Both authors want to thank Percy Deift for his help and very useful 
discussions. 
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Dyson |13| showed that the repulsion between eigenvalues of the Gaussian 
Unitary Ensemble (the GUE) could be described asymptotically in terms of 
the determinantal point process associated with the sine kernel 

1 sin(x - y) 

K(x,y) = . 

7T x — y 

For this limiting determinantal point process, the probability of having no 
eigenvalue in an interval of size s is known to be the Fredholm determinant 
det(Id — iT( 0jS )), where K(o, s ) is the convolution operator acting on L 2 (0, s) 
with kernel K. The density of the spacing between two successive points is 
then given by (see |26j ) 

P2(s) = d ss det(Id - if(o, a ))- 

This spacing distribution was shown to appear in a number-theoretic con- 
text: some statistics produced by Odlyzko [28] presented a correspondence 
between the histogram of normalized gaps between zeros of the Riemann 
zeta function and P2(s). This gave further evidence for the analogy discov- 
ered by Dyson and Montgomery: they realized that the local dependence of 
the zeros of £, previously calculated by Montgomery, involved the sine kernel 
(see [22], [23J for an historical account and other steps of this fruitful anal- 
ogy). At the same mean or typical gap scale, a precise analysis of the joint 
distribution of the gaps between eigenvalues was performed by Katz-Sarnak 
[22] and Soshnikov |30] . for the Circular Unitary Ensemble (the CUE). 

Less attention was paid to eigenvalues statistics at smaller and larger 
scales. This paper concerns the extreme gaps. This study was initiated by 
J. Vinson [35]: he showed that the smallest gap between elements of the 
CUE, multiplied by n 4//3 , converges in law to a random variable with distri- 

3 

bution function e~ x , as the size n of the unitary matrix increases. In his 
thesis, similar results for the smallest gap between eigenvalues of a general- 
ization of the GUE were obtained. Vinson also gives interesting heuristics 
suggesting that the largest gap between CUE eigenvalues should be of order 
\/log n/n, with Poissonian fluctuations around this limit. Using a different 
technique, Soshnikov [32J investigated the smallest gaps for determinantal 
point processes on the real line, with a translation invariant kernel: amongst 
points included in [0, L], this extreme spacing multiplied by L 1 / 3 converges 

3 

weakly to the distribution with distribution function e~ x , as L — > oo. 

Heuristically, the above extreme gaps asymptotics can be obtained using 
the known asymptotics [131 ES] of the spacing distribution 

7T 2 S 2 
P2(s) ~ —S 2 , logp 2 {s) ~ — — , 
s— >0 6 s— >oo 8 

and treating the gaps as independent random variables. The difficulty to 
obtain rigorous results lies in showing that this Poissonian ansatz is asymp- 
totically correct for the extreme gaps, and in making the above estimates 
uniform in the dimension n. 
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We first consider the joint law of the smallest gaps (Theorem |1.1| Corol- 
lary 1.2) between eigenvalues of unitary matrices. This relies both on Sosh- 
nikov's method and a convergence of the process of small gaps to a Poisson 
point process. The same reasoning equally applies to the small gaps between 



eigenvalues from the Gaussian unitary ensemble (Theorem 1.4, Corollary 



1.5). The proofs of the small spacings asymptotics are in Section 2. 

The first order asymptotics of the largest gaps is then proved. Concerning 
unitary random matrices (Theorem 1.3), this makes use of two important 
tools. A key ingredient, by Deift & al [9], is the uniform asymptotics about 
the probability for a given arc of the circle to be free of eigenvalues. The 
proof also requires the negative correlation property for the event that two 
disjoint arcs are free of eigenangles. On account of the GUE (Theorem 1.7), 
we also make a essential use of the negative correlation, and the large gap 
probability is evaluated by comparing the GUE Fredholm determinant with 
the unitary one. These large gaps asymptotics are proved in Section 3. 

The extreme spacings between random eigenvalues are important quanti- 
ties for statistical physics, computational mathematics and number theory. 
For this reason, Diaconis [12] mentions the open question of maximal spac- 



ings, answered in Theorem 1.3 After making our results explicit, succes- 
sively for unitary matrices and the GUE ensemble, we give applications of 
our extremal spacings statistics at the end of this introduction. 
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1.1. The unitary group. Let u n , a Haar-distributed (measure /iu(n)) UIU ~ 
tary matrix over C n . Suppose u n has eigenvalues e l6,fc, s, with ordered eige- 
nangles < 9\ < • ■ • < 6 n < 2ir. Consider the point process on M? 

n 
8=1 

Our first result is about the convergence of x^ to a Poisson point process, 
thanks to this normalization by n~ 4 / 3 . 

Theorem 1.1. Suppose u n ~ /^ufn)- As n — > oo, the process converges 
to a Poisson point process x with intensity 

for any bounded Borel sets A C M + and I C (0, 2ir). 

The intensity is proportional to Jj du because of the rotational invariance 
of the Haar measure. The corresponding factor will be less trivial in the 
case of the GUE ensemble. Our method to prove Theorem 1.1 relies on the 
s-modified random point field technique initiated by Soshnikov [301 132] : one 
can calculate the correlation functions of the process obtained by keeping 
only the O^s for which 8^ + An~ A /^ contains exactly one other eigenvalue. 
Contrary to [30l|32], we do not use the notion of cluster functions, because 
we characterize the convergence to Poisson random variables thanks to the 
convergence of the factorial moments; this allows also to consider easily non 
translation invariant kernels, like in the GUE case. Moreover, Theorem |1.1| 
gives information about the joint distribution of the number of gaps taking 
values in disjoint intervals (convergence in terms of point processes). In 
particular we can compute the limiting joint law of the smallest gaps. 

Let ti < • • • < be the k smallest eigenangles gaps (i.e. of the form 
8i + i — 9i, where the indexes are modulo n and gaps are denned as small as 
possible, i.e. 6i + \ — 6i S [0, it]). For the sake of brevity, write 

r H = (727r)- 1 /3^). 
The limiting joint law of the r^'s is a corollary of Theorem 1.1 



Corollary 1.2. For any < x\ < y\ < ■ ■ • < < y^, under the Haar 
measure on U(n), 

x £ < n 4 / 3 rj n) < w , 1 < £ < k) (V** - e"«2) f[ (yf - xf) . 

e=i 

(1-1) 

In particular, the k th smallest normalized space n^^T^ 1 converges in law to 
Tk, with density 

F(r k e dx) = — ^— x^e-^dx. 

[k-iy. 
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Note that this result, for k = 1, is proved in Vinson's thesis [35] by a 
different method: he characterizes the number of small gaps as a symmetric 
function of the eigenvalues, and computes its moments. It is not clear how 
his method can be extended to provide the joint law of the k smallest gaps. 

We now turn to our next question about extreme gaps, i.e the asymptotic 
behavior of the largest gaps, which were guessed by Vinson, based on the 
supposed asymptotic independence of distant gaps. We obtain thanks to the 
precise asymptotics of one gap probability, obtained by the steepest descent 
method for Riemann Hilbert problems in [9], and the negative association 
property of determinantal point processes (see e.g. [3]). Note that both 
results are posterior to Vinson's thesis. 

Consider 7^ > > • • • the largest gaps between successive eigenan- 
gles of u ~ H\}(n)i i- e - °f the form \Qi+\ — 9{\, where the indexes are modulo 
n and — 9i\ £ (— 7r, it). Then, as n goes to infinity, the largest gap 
converges in L p to a constant, for any p > 0: 

n r} n) ^4 1. 



-y/321ogn 

Actually the above limit holds for all the £ n largest gaps if i n is subpoly- 
nomial. 

Theorem 1.3. Let £ n = n ^ be positive integers. Then for any p > 0, 



" rt ] ^ i 



V / 32~Togn 



as n 



ocB 



Note that, for independent uniform eigenangles on the unit circle, the 
largest gap is of order (log n) / n, more than in the above theorem, as expected 
from the repulsion of the eigenvalues in the determinantal case. 

1.2. The Gaussian unitary ensemble. Similar results hold for the GUE. 
For this ensemble, the distribution of the eigenvalues has density 

J-e-*£? =1 Af/ 2 11 | Ai -A/ (1.2) 

i<i<j<n 

with respect to the Lebesgue product measure, on the simplex Ai < • • • < A n . 
The empirical spectral distribution ^ ^ 5\ . converges in probability to the 
semicircle law (see e.g. [1]) 



Psc{x) = ^7(4" 



X 2 ) 



2 A detailed analysis of the proof gives a speed of convergence: for example one can 
show that for any sequence a n — 0(1), 

The problem of the exact fluctuations will be addressed in a future work. 
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Like for the unitary group, we first consider the smallest gaps, studying the 
point process 

n-1 

£(») = 5^a (n 4/3 (A . +1 _ A . )iA . ) i|A i |<2- eo > 

1=1 

for any arbitrarily small fixed £o > (this is a technical restriction allowing 
the use of the Plancherel-Rotach asymptotics of the Hermite polynomials). 

Theorem 1.4. As n — > oo, the process x^ converges to a Poisson point x 
process with intensity 

1 f .2 A .\ ( [ tA ™2n2. 



Ex(AxI)= J A udu ) [J^-xTdx 

for any bounded Borel sets A C M+ and I C (—2 + eq, 2 — eq). 

The following corollary about the smallest gaps is an easy consequence of 
the previous theorem. As for the unitary group, introduce < • • • < 
the k nearest spacings in I, i.e. of the form Aj+i — Aj, 1 < i < n — 1, with 
Xi e I, I = [a, b], -2 < a < b < 2. Let 

1/3 



fi n) = ( jf (4 - z 2 ) 2 d*/(144vr 2 )) 



Corollary 1.5. For any < x\ < y\ < • ■ ■ < Xk < Vk, with the above 
notations and for the GUE ensemble measure (1.2), 



k-i 



x e < n 4 / 3 f\ n) < yt , 1 < I < k) -+ ( e -*S - e^l) J] (yf - x 3 ) . 

n 00 e=i 

In particular, the k th smallest normalized space n^ 3 f^ converges in law to 
Tf., with density 

{k - 1)! 

The above result gives for k = 1 the probability density of the normalized 
smallest gap in a given interval J: 3x 2 e~ x . The following precises the 
distribution of this smallest spacing location in the bulk of the spectrum. 

Corollary 1.6. Let inf be the index of the smallest gap between eigenvalues 
of the GUE in a compact subset I C (—2, 2) with nonempty interior: 

\(n) ,(n) • T ,rr\( n ) \(") I \W c n 

A inf+1 _ A inf - mt lA+l - A i I A i t ^i- 

! 

proportional to 



As n —7- oo, A inf converges weakly to the probability measure with density 



(i-x 2 )H x£l 
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We now turn to the largest gaps for the GUE ensemble. The result is 
completely different inside the bulk and on the edge. Indeed, for eigenvalues 
strictly inside the support of the limiting measure, the maximal spacings 



have order -^/log n/n (see the following Theorem 1.7), while the eigenvalues 



on the border have an average distance of higher order, n 2 / 3 : for any k, 

n 2 / 3 (A n - 2, . . . , \ n _ k - 2) 

converges weakly as n — > oo to a multivariate Tracy- Widom distribution 
(see e.g. [1]). Strictly inside the bulk, the result is analogous to the circular 
case, the only difference being the normalization, due to the average density 
of eigenvalues. Let 7^ > 7^ > . . . be the largest gaps of type Aj+i — Aj 
with Aj 6 /, a compact subset of (—2,2) with nonempty interior. 

Theorem 1.7. Let l n = be positive integers. Then for any p > 

xel v J ^/32 logn £ ™ 

1.3. The C zeros. When seen in a window of size proportional to the av- 
erage gap, the spacings between the zeros of Dirichlet L- functions are dis- 
tributed like particles of a determinantal point process with sine kernel, this 
is the Montgomery-Odlyzko law. Here we want to discuss the accuracy of 
this analogy when looking at rare events, the extreme gaps between the zeta 



zeros, relying on Theorems 1.1 and 1.3 



Due to the availability of many numerical data, we focus on the Riemann 
zeta function ((s) = Yl^=i V nS (£He(s) > 1)> which admits an analytic 
continuation to C — {1}. Let 1/2 ± itk be the non trivial zeta zeros, 7, = 
JHe(ii), with < 71 < 72 < . . . Then 

7i+i - 7i , / li 

has an average value 1. The quantity 

A = lim sup 71 

i— >co 

has been widely studied. Conditionally to the generalized Riemann hypoth- 
esis, the best known result is A > 3.0155 [5J. From the GUE hypothesis 
for the zeta zeros, it is expected that A = 00. However, to the best of 
our knowledge, more precise conjectures about the growth speed of large 



gaps between zeta zeros were not proposed. From Theorem 1.3, amongst 



n successive gaps with fermionic repulsion, the maximal gap has size about 
V / 321ogn/(27r) times the average gap, suggesting 

-y/32 logn 



sup 7j ~ 

m<k<m+n n ^°° 27T 



in particular 



lim sup (7*4-1 - 7i) = 1- 
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Odlyzko's numerical data [28] give 3.303 for the maximal value of ji, 1 < i < 
n = 10 6 , while ^32 log n _ g ^^g^ giving a difference of 1% with the observed 
gaps. 

Further tests can be performed at 
distinct heights along the critical S[ 

axis, thanks to numerical data of . . . ....»»«» 

Gourdon [IB] : he computed n = 2 x 1 ' 
10 9 successive zeta zeros at height 3 ; 
10 fc along the critical axis for each 
k G [13,24]]. The extreme nor- 2 : 
malized gaps are given on the joint | 
graph, where the expectation from 

our random matrices result is the r . , 

/oTpj 12 11 16 18 20 22 24 

straight line V32 „ logn = 4.166. For 
example, amongst the 2 x 10 9 gaps 

following the height 10 24 , sup7i = 4.158, i.e. a difference of 0.2% with the 
expected value. 

Concerning the smallest gaps, does the Poisson intensity 23^i* 2 dii from 
Theorem 



1.1 



appear in the context of the zeta zeros ? Note 9{ = n(6i+i — 
#,)/(27r) the normalized gaps. We know that, as n — > oo, the set of gaps 
{27rn 1 / 3 ^,l < i < n] converges weakly to a Poisson point process with 
intensity ^u 2 du. We therefore expect that, as n — > oo, 



27rn 1/3 {7i, 1 < i < n} 



converges to a Poisson point process 
The joint graph gives the histogram 
of the 3000 smallest gaps, normal- 
ized as previously, amongst the n = 
10 13 first zeta zeros, based again on 
numerical data from [18] . More pre- 
cisely, the histogram gives the 3000 
smallest values of 27rl0 13 / 3 {7j, 1 < 
i < 10 13 }. The straight line is the 



with the same intensity. 



function 5 x 



2 (the step of the 
This presents a 




histogram is 5). 
good relevance of the GUE hypoth- 
esis for the Riemann zeta function, even at the scale of rare events, here the 
extreme spacings. 



1.4. Diagonalization speed with the Toda flow. The most classical 
method to diagonalize a matrix is the well-known QR algorithm. In the 
case of Hermitian matrices, an alternative approach was proposed by Deift 
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& al [lOjfJ based on the isospectral property of the Toda flow. More precisely, 
given a nxn Hermitian matrix M , the first step is to reduce it in a tridiagonal 
form T (which is a robust and fast operation), conjugating with successive 
Householder reflections, 



T 



h 

a-2 



M-i 



K-i 

a n 



keeping the same spectrum as M. After such a tridiagonalization, The a, 
and biS are real. The above matrix is important in the analysis of the Toda 
lattice: Flaschka, Henon, and Manakov proved independently in the 70's 
that the following evolution of n particles on a line [xq = — oo, x n+ \ = +00, 
1 < k < n), 

itk = e Xk ~ 1 ~ Xk — e Xk ~ Xk + 1 
is an integrable system. More precisely, after the change of variables 

a% 

bi - 2 

the differential equation takes the Lax pair form 

dT 

where 

b x 



-Xi/2 

l e (xi-x i+1 )/2 



ST-TS 



S 



n 







b n -l 





In particular and importantly, the spectrum of T(t) does not depend on time. 
Moser proved that Xk(t) = \k + 0(1), Xk(t) = Xkt + + 0(1) as t — > 00, with 
Ai < • • • < A n . This implies that bk(t) converges to 0, hence T converges to 
a diagonal matrix, whose entries give the eigenvalues of M = T(0). Deift [7] 
asked about the speed of convergence of the Toda flow till its equilibrium. 
More precisely, for a given e > 0, what is the necessary time t such that 
b k (t) < e for all 1 < k < n - 1? As 

£ fc m _ e |(Afc-Afc + i)t+i(jtifc-/i fc+ i)+0(l) j 

the speed of convergence to the spectrum is governed by the minimal gap 
between eigenvalues. A good choice for a typical Hermitian matrix is a ma- 
trix from the GUE, with independent (up to symmetry) complex Gaussian 



Their approach is enounced for symmetric matrices, and naturally extends to the 
Hermitian case. 
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entries (of variance 2 on the diagonal, 1 elsewhere). Prom Corollary 1.5, the 
minimal gap for such matrices is of order 



nn" 4 / 3 =n- 5 / 6 . 



For a given required precision e, the Toda flow time necessaryto evaluate 
the eigenvalues is expected to grow as n 5 / 6 with the dimension^} 

2. Small gaps 

2.1. Convergence to Poisson point processes. We first review some 
results about convergence of processes, here with values in M 2 and a finite 
number of atoms. A point process x^ = Yli=i n ^ s sa ^ to converge in 
distribution to \ = J2i is for any bounded continuous function / 



X W (/) = £/(*i,rO^ 

i=l i 

To show the convergence of x^ to a Poisson point process %, we only 
need to show the convergence in law of x (A, I) to x(A, -0 f° r au bounded 
intervals A and /, the independence for disjoint A x Fs being an automatic 
consequence: this is a very practical property of Poisson point processes, 
detailed in Proposition 2.1 below. Moreover, the convergence of x (A,I) 
will be shown thanks to the convergence of the factorial moments to those 
of a Poisson random variable. This is particularly adapted to our situation 
because as we will see the correlation functions of point processes are defined 
through factorial moments, and are explicit in the case of determinantal 
point processes. 

Note that this is the same technique employed in [3], where the following 
result is given, in the case of point processes with values in R. 

Proposition 2.1. Let x^ = S»=i <^Xi „ be a sequence of point processes 
on M 2 , and x a Poisson point process on M 2 with intensity fi having no 
atoms (and a -finite). Assume that for any bounded intervals A and I and 
all positive integers k > 1 

n^oo y( x ( n )(Ax I)-k)\J ^ ' v ' 

Then the sequence of point processes x^ converges in distribution x- 



Proof. We need to check the three conditions of the following Theorem 2.2 
by Kallenberg, which is written here in the specific case & = R 2 , U and J 
the set of compact rectangles A x I [A and / intervals), with the notations 
from [21:. Conditions (1), (2) and (3) will be verified if 

X {n) (AxI)^ x (AxI). (2.2) 



4 Note that more precise estimates would need the asymptotics of fik—fJ-k+i as a function 



of n 
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Indeed, if (2.2) holds, for any t 

(x {n) (AxI)>t)=F( X (AxI)>t), 



lim 1 

n— >oo 



and as x{A x I) is almost surely finite this goes to as t — > oo (is is a 
consequence of the dominated convergence theorem); this proves part (1). 
Equations (2) and (3) are also consequences of the above convergence in law. 

To prove ( |2.2[ ), as x{A X J) is a Poisson random variable, the convergence 
of all the moments is sufficient. A moment is a finite linear combination of 
factorial moments, which concludes the proof. □ 

Theorem 2.2 (Kallenberg, [2T]). Let x be a point process onM 2 and assume 
X is almost surely simple (i.e. the atoms of the measure x a ^ have weight 
1 almost surely). Then X ^ n ' converges weakly to x if an d only if the three 
following conditions are satisfied for any compact intervals A and I inM: 

(1) lim^oo hnwoo P {x in \A x I) > t) = 0; 

(2) lmWooP ( X (n) (A x I) = 0) = F( X (A x I) = 0); 

(3) hmsup n ^ 00 P( X ^(^ x /) > 1) < F( X (A x /) > 1). 

2.2. Correlation functions. References for the main properties of correla- 
tion functions of determinantal point processes are [31] and |20j . We follow 
this last survey to present the notions used in the following. If X = Si 
is a simple point process on a complete separate metric space A, consider 
the point process 

X il all distinct 

on A k . One can define this way a measure on A k by 

M {k \A) =E(~ (k \Ai 



for any Borel set A in A k . Most of the time, there is a natural measure A 
on A, in our cases A = R or (0, 2ir) and A is the Lebesgue measure. If 
is absolutely continuous with respect to A fc , there exists a function on A fe 
such that for any Borel sets B\ , . . . , Bf. in A 



M (k) (Bi x ••• x B k ) = [ 

Jb 1 



p k {xi, . . . ,x k )dX(xi) . . . dA(x fc ). 

x--xB k 



Hence one can think about pk(xi, ■ ■ ■ ,x k ) as the asymptotic (normalized) 
probability of having exactly one particle in neighborhoods of the x^'s. More 
precisely under suitable smoothness assumptions, and for distinct points 
x%, . . . , Xk in A = M, 

p k (xi, ...,x k ) = lim —r — — P (x(xi, Xi + e) = l,l<i<k). 
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Note that pt is not a probability density. Moreover specifically, if B\ , . . . , Bi 
are disjoint in A and m + • • • + ni = k, 



M^{B^x---xB^)=E TT n 

\r=i (x(jB 



(x(BO)i 



no- 



see |20j for a proof. In particular, 

M (*)( 5 *) = E ( ^^i ) = ^ • ■ • , ^)dA(xx) . . . dA(a*). 

(2.4) 

Note the analogy with formula (2.1 ) we want to prove. For unitary matrices 
or the GUE ensemble, our method to prove convergence of small spacings 
counting measures is the same: 

• For given compact intervals A and / consider the modified process 
obtained from £( n ) = Yl &Xi „ by keeping only the points Aj jn in I 
such that x (n) ( x i,n + An" 4 / 3 ) = 1. 

• Show that the correlation function p^\xi, . . . ,x n ) of this new pro- 
cess uniformly converges to p(A x I) k . This is possible thanks to the 
determinantal aspect of £( n ) and the Hadamard-Fischer inequality, 
Lemma 12.31 

• Conclude that the factorial moments converge to those of the ex- 



pected Poisson random variables thanks to (2.1) and (2.4) 

For the smallest gaps asymptotics, the following inequality will be repeat- 
edly used. A concise proof can be found in |19j . 

Lemma 2.3. Let M be a positive- definite n x n (Hermitian) matrix. For 
any uj C [l,n]], let M u (resp. M^) be the submatrix of M using rows and 
columns numbered in uj (resp l_l,n}/u)). Then 

det(M) < det(M w )det(Msr). 

2.3. The unitary group. We begin with the proof of Theorem |1.1[ We 
know that for a unitary matrix u n ~ Hxs( n ) > the density of the eigenangles 
< 6 1 < • • • < 6 n < 2ir, with respect to the Lebesgue measure on the 
corresponding simplex is 

1 TT uiflj _ e ie fc |2 

Moreover, a remarkable fact about the point process ^ 5 e w k is that it is de- 
terminantal: all its correlation functions pY , 1 < k < n are determinants 
based on the same kernel: 

This classical property relies on Gaudin's lemma, see [26J. In the following 
for any bounded interval A C R + we write A n = n~ 4 ^A. We want to show 
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that for an interval I C (0, 2tt) 

X {n) (AxI) ^>Po(A) 
with A = J A u 2 duj (Jjdu). We consider the point process 

n 
i=l 

and its thinning f( n ) obtained from £( n ) by only keeping the eigenangles Ok 
for which ^ n '(9k + A n ) = 1. The following lemma means that x {n) (A x I) 
is properly estimated by It is analogous to Lemma 3 in 



Lemma 2.4 (No successive small neighbors). For any interval I C (0,2tt), 
as n — y oo, x (n) {A x I) - f( n )(J) ^> 0. 

Proof. Let c be such that A C (0,c), and c n = cn~ 4 / 3 . If lg _g. eAn ^ 
t^n )(e . +An)=1 , then ^ n \9i + (0,c n )) > 2. Hence 

n 
i=l 



where the last inequality comes from the definition (2.3), where A is the set 
of points (8,xi,X2) with 9 G (0,2it) and (xi,X2) G (9,9 + c n ) 2 . To prove 
that this positive random variable converges in law to 0, we consider its 
expectation 

i-2tx 



dO 



U(n)/ n \ 
P$ '{0,Xi,X 2 ) 



p^ n \6, x±, x 2 )dxidx 2 = 2n / p^ n \o, xi, x 2 )dxidx2 

'(0,c„) 2 J(0,c n ) 2 

and show it goes to 0. Thanks to the multilinearity of the determinant, 

K u (")(0) K v ^\ Xl ) - K u (™)(0) K u ( n )(a; 2 ) - K u (™)(0) 

^ u (")(a;i) i^T u ( n )(0) - K vi - n \xi) K v{ - n \ Xl - x 2 ) - K v{n \ Xl ) 

K v ^(x 2 ) K^ n \ Xl - x 2 ) - K v{n \x 2 ) K u (")(0) - K v ^ n \x 2 ) 

As |^ u(?l) |oo = O(n) and |# /U W|oo = 0(n 2 ), the first column of this deter- 
minant is 0(n) and the two others are 0(n 2 c n ). Thus 

P J (n) (0,x 1 ,x 2 ) = O(n 7 / 3 ). 

The integration domain is c 2 = 0(re -8 / 3 ), concluding the proof. □ 

Let pi^ n \0i, . . . ,0k), k > 0, be the correlation functions of the point 
process If, for any k > 1, the convergence of the factorial moment 



\ ({<»)(/) A* 



, u 2 du / — , (2.5) 
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can be shown, then Theorem 1.1 will be proved, thanks to the above Lemma 



2.4 The way to show (2.5 ) relies on three steps, to apply a simple dominated 



convergence argument: 

• if all 6 k s are distinct p^ n \0\, ■ ■ ■ ,9 k ) converges to (jg^ f A u 2 du) k 
as n — > oo (Lemma 2.5); 

• in the set 

fiU(n) = ^ 0k) e jk . Q . g q. + A ^ 1 < z , j < k}, (2.6) 

pl {Si-, ■ ■ ■ , 9 k ) is uniformly bounded (Lemma 2.6); 

• even if flj (n) (0i,... , is not uniformly bounded in the complement 
of fi u ( n ) in J fc (fi ), the contribution to the integral is negligib 



because the volume of fi U ^ decreases sufficiently fast (Lemma 2.7). 



Lemma 2.5 (Simple convergence). Let 9\, . . . , 9 k be distinct elements in I k 
Then 

pf%,...A) -> / u 2 du 

n-^oo Y487T Z 

Proof. First note that, as all the #fc's are distinct, for sufficiently large n 
the point (0 l3 . . . , 9 k ) is in fi u(n) (see (Q). This means that if l3 . . . , fe 
are points of the point in each of the 0j + A„ is not another one of 
the 0,'s. This makes the combinatorics easy: the correlation functions of 
can be explicitly given in terms of those of as noted in [31], by an 
inclusion-exclusion argument: for sufficiently large n 

^(^•••A)=£^P/ d Xl ...f d, k 

^ ml J9i+A n Je k +A n 

/ /°2fe+m( l' x l' • • -^k,x k ,yi, • • ■,y m )dy 1 . ..dy m . 

J ((0i+A„)U-U(0 fe +A„))™ 

(2.7) 

Note that there is no convergence issue here as p^ k ^ m = if 2k + m > n. 
We first show that the term corresponding to m = in the above sum gives 
the expected asymptotics. The determinantal aspect of the process makes 
things easy: Pofc (01, x\, ■ ■ ■ , 9 k , x k ) is a 2k x 2 A; determinant, and only the 
terms in the 2x2 diagonal blocks make a significant contribution (which 
leads to the idea of asymptotic independence). More precisely, we write 
formally 

As |a?i - 0i| = 0(n" 4 / 3 ) and K v( - n \x) = sin(ns/2)/ sin(x/2), if i 7^ j all 
terms of the corresponding 2x2 above matrix are 0(1). Moreover, the above 
determinant is unchanged by subtracting an odd column to the following 
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even column, and then by subtracting an odd line to the following even line. 
In this way, the diagonal 2x2 matrices becomes 

K u (") (0) K v ^ (9i -Xi)- (0) 

K v ^ (xt -0i)- -FsT u(n) (0) 2K u ( n ) (0) - if u (™) ( Xi - 9$ - K 15 ^ (0* - x 

0{n) 0(n 2 / 3 ) 
0(n 2 / 3 ) 0{n 1 ' 3 ) 

where the last equality relies on |if u ^|oo = 0(n), \K' v ( n >\oo = 0(n 2 ) and 
|ir" u ( n )|oo = (9(n 3 ). As a consequence, in the expansion of the determinant 
over all permutations of S2 k > the terms corresponding to entries only in the 

4 i 

diagonal 2x2 block matrices have order at most ns , while all other terms 
have a strictly lower order (at most n3 fc ~5). As the integration domain of 
P^k ls 0(n~3 k ), the only permutations hopefully giving a non-zero limit 
need to come from the block diagonal 2x2 matrices. Indeed they give a 
non-trivial limit: their contribution is exactly 

A simple change of variable x = n 4 / 3 u allows to conclude, thanks to the 
easy limit, uniform on compacts, 



1 ^ 2/3 / f sin(n- 1 /3 u/2 ) \ 2 \ | 



n~'~ i - — — — • — > t^tm 



2 



(27r) 2 I I nsin(n" 4 / 3 u/2) J J n->oo 48ir 2 
Our last task is to show that in the limit (|2.7l) is equivalent to its m = 



term. By iterations of the Hadamard-Fisher inequality, Lemma 2.3 



P2k+m(. l> x l> ■ ■ -,6k,x k ,yi, . . . ,y m ) < p^Vi.^ii • • • « d k, x k ) Pi^iVi)- 

i=l 



The contribution of all terms with m > 1 in (2.7) is therefore bounded by 

/ dxi... I dx k p2 k {n) (9 1 ,x 1 ,...,9 k ,x k ))y2^ L i ( I Pi^iy)) 
Je 1 +A H Je k +A n J m y 1 ml \Ja n J 

where the integration domain a n = {9\ + A n ) U • • • U (9 k + A n ) has size 
0(n~ 4 / 3 ) and p^ \y) = n. The first term of the above product converges, 
as previously proved (it corresponds to m = 0), so the whole term goes to 
as n — y oo, concluding the proof. □ 

Lemma 2.6 (Uniform boundness). There is a constant c depe nding only on 
A such that, for any n>l and (6%, ... ,9 k ) G f] u ( n ) (see (2.6)), 

p V k in \9 1 ,...,9 k )<c. 
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Proof. As previously mentioned, the formula (2.7) is true whenever, for 
all distinct i and j, 0j ^ Oi + A n , i.e. (0i,...,0 n ) is in $7 U(n ). Using 
the Hadamard-Fisher inequality as in the proof of the previous lemma, 
pi (0%, . . . , k ) is therefore bounded by 

/ dxi... j dx k p^ n \0i,xi,...,0 k ,x k ) ) y2 — t ( I Pi^Hv) 

Jfa+An Je k +A n J m > Q m - \Ja n 

with a n = (0\ + A n ) U • • • U (Ok + A n ). Once again, the Hadamard-Fisher 
inequality gives 



P2 k (fllj^i) • • • A,^fc) < JJp2 (n) (6>i,Xi). 

i=l 

This gives the upper bound, uniform in (0\, . . . , n ) G f2 u ( n ), 



converging to 



u 2 du 



48tt 2 

as previously seen. □ 

Remark. A better upper bound in the previous proof can be obtained as 
follows. By a direct ensembles argument, p k (Oi, ■ ■ ■ , k ) is bounded by 

dxi... / dx k pzl (0i, x%,..., k , x k ) 
i+a„ Je k +A n 



This also comes from the fact that the inclusion-exclusion series (2.7) is 
alternate. We know by Fisher-Hadamard that this upper bound is lower 
than (J. p2^ n \o,x)dx) k , which is interpreted as follows: p k converges 
to its limit from below, which is a sign of repulsion before the asymptotic 
independence. 

Lemma 2.7 (Negligible set). Let f2 U ^ n be the complement of J2 u ( n ) in I k 
(see pT^j. Then 

f p k ](n) (0 l ,...,0 k )d0 l ...d0 k — ► 0. 

Proof. Let (0\,..., k ) G £^^ n \ and note = {0±, . . . , k } the set of these 
points. 

For notational convenience, one can 

suppose 0\ < ■ ■ ■ < k . A set of a 
points S < ■ • ■ < 0t is said to be ^ ^ »■ q 

a cluster of if, for all s < k < t, « - N 
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9k+i G 6k + A n . Let Ext(0) be the set of points which cannot be included 
in a maximal cluster: 

Ext(0) = {9i : 1 < i < k, (9i + A n ) n 6 = 0}, 

This way we get a partition 

6 = Ext (6) uf =1 Ch 

where there are t maximal clusters Cli, . . . , Cb\ Suppose that Ext(B) = 
{9, L1 < ■ ■ < 6i p } where p = | Ext(0)|. Then the following obvious bound 
holds: 



P^ n \0i, ■ ■ ■ ,9k) < / dsi... / dx p p^J(9,...,9 k ,x 1 ,...,x p ) 
Je H +A n Je tp +A„ 

j=i j = iJe i:j +A n 

where we used the Hadamard-Fisher inequality. This last product of p ele- 
ments is bounded, uniformly in (9±, . . . , 9k), as it is equal to the pth power 
of 

/ p? n \o,x)dx, 

which converges. Concerning the first product, we analyze each term in 
the following way. For any 1 < t < k — 1, the correlation is a t x t 

determinant of jf u ( n ). Suppose that all arguments of p^™^ are included in 
an interval of size c n . By subtracting the first row from the t — 1 others 
one obtains that the first row is 0(n) and the others are 0(n 2 c n ), because 
|-f^' U ^|oo = 0(n 2 ). Hence /j[ I< ™' ) = 0(n 2t_1 c^ _1 ). As a consequence, as all 
points of a cluster CL, are within distance 0(n~ 4 / 3 ), 

p^ | (Cl l ) = 0(nll^N). 

This leads to the upper bound 

But the size of the set of points {9\, ...,#&) G with such a clusters 

configuration is 0{n~^ k ~ p ^), because there needs to be k — p small gaps 
(i.e. of size 0(n -4 / 3 )) between successive 0fc's. Hence the total contribution 

of such clusters to the integral over f2 U ^ is 



(nI-K fe -^) 



which goes top because t < k — p (with equality if all clusters have only 
one point) and k — p < 2{k — p) {k = p is not possible: this would mean that 

(0i,...A)*n u(n) ). □ 
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As previously explained, the three above lemmas conclude the proof of 
Theorem 11.11 



Proof of Corollary L2_. Note that the events {xe < n 4 / 3 < ye, 1 < £ < 
k} and 

{ X (n) ((72vr) 1 /3 (a;fc;yfc ) )( o,2^)>l, 

X (n) ((72vr) 1 / 3 (x £ , ye), (0, 2vr)) = 1, 1 < i < k - 1, 

x W((72 7 r) 1 / 3 (^_i,^) J (0, 2tt)) = 0, 1 < I < k} 

are almost surely the same (yo = 0). The independence property of the limit 
Poisson point process % m disjoint subsets therefore yields 

x e <n 4 / 3 7> (n) <y E ,l<e< fc) 

1 _ e -&2-*2)) \[(yl - xl)e~^-^l[e-^-y^\ 



n— >oo 



=1 1=1 



where we noted that, for any interval (a, b), x^((727r) 1//3 (a, b), (0, 2ir)) is 
a Poisson random variable with parameter (6 3 — a 3 ). A straightforward 
simplification of the above products gives the expected result. Concerning 
the limiting density of n 4 / 3 , we proceed in two steps. First, from formula 



(1.1), the joint density of n 4 / 3 (t^ , . . . , ) is, in the limit and on the 



simplex < u\ < • • • < u k , proportional to 



2 2 2 -ui 

u 1 u 2 ---u k e k . 



Consequently, 



X 



[T k < x) = c k u k e u h I Ul u 2 . . . u^dui . . . du fc _i 

Jo Jo<u±<-~<Uk-i<Uk 
f X 2+3(fc-l) -ui f 2 2 2 i i 



k 

JO J0<v 1 <-~<v k _ 1 <l 

so the density of T k is proportional to x 3k ~ 1 e~ x ' 3 . □ 

2.4. The Gaussian unitary ensemble. The small gaps asymptotics for 
the GUE are obtained exactly in the same way as for the unitary group. 
The only difference is that the determinantal kernel is not translation in- 
variant anymore, leading to some complications. More precisely, let (h n ) 
be the Hermite polynomials, more precisely the successive monic orthogonal 
polynomials with respect to the Gaussian weight e~ x ' 2 dx. Following [I], 
where the following results on the determinantal aspect of the GUE can be 
found, we introduce the functions 

e -x2/4 

4>k{x) = i , h k (x). 

VV^rkl 
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in 



Then the set of point {Ai, . . . , A n } with law (1.2) is a determinantal point 
process with kernel (with respect to the Lebesgue measure on K) given by 



K GVE ^(x,y) 



n- 



x-y 



3/2 U n -i(yV^) J q tf n (txVn + (1 - t)y^fn)dt 

-ipnivVn) J ip' n _ 1 {txy/n + (1 - t)yy/n)dtj 



(2. 



We will now discuss why Lemma |2.4| till Lemma |2.7| still hold in the case of 
the GUE ensemble, restricted to (—2 + eq,2 — £q) for a given eo > 0- Note 
that the Hadamard-Fischer inequality, an important tool for the proof of all 
the following lemmas, still holds for the Gaussian unitary ensemble because 
the set of its eigenvalues is a determinantal process. 



We now note 



-so 



1=1 



and its thinning £( n ) obtained from £' n ) by only keeping the eigen- 
values Afc for which ^^(Afc + ra -4 / 3 ^ = 1). The absence of successive 
small gaps, Lemma 2.4, only requires in its proof that |Jf GUE («) 
00), I^^U^loo = 0(ra 2 ). This is proved in LemmalT 



The analogue of the simple convergence of the correlation function 
associated to the thinned point process, Lemma 2.5, is now, for any 
distinct points Ai, . . . , A& in (—2 + eq,2 — eq), 



~GUE(n), x N v 

P k (Ai, • • • , A fe ) 



1 



48vr 2 



u 2 du 



k k 



IK" 

i=i 



The proof, in the same way as the unitary case, requires uniform 



bounds on the partial derivatives of the kernel, given in Lemma 2.8 
and the asymptotics 



p^ VE{n) (Xi,x)dx —> (— . 
\ l+ A n n ^°° V 487r J A 



u 2 du) (4 -A 2 ) 2 , 



which is a direct consequence of Lemma |2.9 



The uniform boundness result, Lemma 2.6 only requires the Hadamard- 
Fischer inequality, Lemma 2.3, which applies to any determinantal 
point process. 

Finally, we give the analogue of Lemma 2.7 in the following way. Let 
^GUE(n) = {(Ai; ... iAfe )el*:A^ Xj +A n ,l< i,j < k}, 
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and ^ GUE ( n ) kg its complement in I k , where I is any Borel set in 
(-2 + e ,2-£ ). 

^ UE(n) (Ai,...,A fc )dAi...dA fc — > 0.. 



n GUB(n) ' « n^oo 



The only estimate necessary to this result is that the first order 
partial derivative of the kernel is uniformly O(n), which is one of the 



estimates of the following Lemma 2.8 



Lemma 2.8. Leteo > 0. Uniformly for x , y £ (— 2+eq, 2— eq), K GlJE ( n \x , y) 
is 0(n), the first order partial derivatives of /f GU E(n) are Q( n 2^ an( ^ ^ e sec _ 
ond order ones are 0(n 3 ). 

Moreover, under the additional condition \x — y\ > 5 > 0, K G ^ E ( n \x,y) 
is uniformly bounded, by a constant depending only on Eq and 5. 

Proof. Prom the Plancherel-Rotach asymptotics for the Hermite polynomi- 
als (Theorem 8.22.9 in [33]), for any nonnegative integer k, ip n ~k(Vn x ) is 
0(1/ y/n), uniformly in x 6 (—2 + eq,2 — eq). Consequently, if \x — y\ > 6, 
from the first line of (Q, K GUE (") is uniformly 0(1). 

To prove the first assertion of the lemma, we use the stability property of 
the functions (ipk) by derivation (see [1] Lemma 3.2.7): 

x 

4>' n (x) = --lpn(x) + y/ntpn-^x). (2.9) 



Injecting this expression of ip' n in the last two lines of (2.8) and using ip n = 
0{l/y/n) yields K GVE( - n \x, y) = O(n) uniformly for x,y G (-2 + e , 2 -e ). 
Iterating this procedure, any partial derivative of if GUE ( n ) Q f order k are 
n k+2 ^j mes a Ji near combination of factors of type ip n —i Jq if>n—j} which shows 
that any partial derivative of order k is 0(n fc+1 ). □ 

The Plancherel-Rotach asymptotics for Hermite polynomials also yield a 
precise evaluation of the correlation functions evaluated at close points, in 
particular the following result. 

Lemma 2.9. Let Eq > 0, c > 0. Then as n — > oo, uniformly in x E 
(-2 + eo,2 -E ), \u\ < cn~ 4 / 3 , 

p^ E{n) ( x ,x + u) = ^ n 4 (4 - x 2 ) V + O (n) . 

Proof. The intuition for this result is as follows. From the well-known con- 
vergence to the sine kernel, 

1 -K G ^(x,x + V ^ ■ Sin( ™ } 



np sc (x) \ ' np sc (x) J n->oo TTV 

If this convergence is sufficiently uniform, one expects for small u a Taylor 
expansion 

K G ^ (x, x + u) « Sin(7r ^ (x)u) * n Psc (x) - W. 

TTU D 
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Injecting this expansion in 

pGUEW x+u) = ^ G UE(n) ^ x) ^GUE(n) (x + ^ x + u )_i^UE(n) ^ ^ + u) 2 

gives the expected result. To justify the accuracy of the sine kernel ap- 
proximation for u close to 0, we rely on Corollary 1 in (more general 
asymptotics in the full complex plane for, in particular, Hermite polynomials 
were obtained in [H]): using the Plancherel-Rotach asymptotics of Hermite 
polynomials the authors show thatj^] 

K GUE(n) ( j = 1 sin (n(a e - a )) Q 

2Wsina;sin0 sin (w - 8) 

uniformly for x, y in (—2 + £0,2 — eq) where x = 2 cos a; , y = 2cos#, < 
8, uj < 7r, a w = sin(2a;) — 2a;, a# = sin(20) — 26*. As ag — a^ = 2{u — 8) sin 2 # + 
0((w — #) 2 ) and a; — = (y — x)/ v 4 — x 2 + 0((x — y) 2 ), a simple expansion 
yields 

^K GUE W(*,y) 

/9 sc (x) 

1 sin(2ra(u;-#)sin 2 # + 0(n(w-#) 2 )) 



27r 2 p sc (x) v /p S c(x)p sc (2/) sin ( + 0((x _ )2) 

_ sin (nir(x - y)p sc (x) + 0(n(x - y) 2 )) 
tt(x - y)psc(x) + 0((aj - y) 2 ) 

For x — y = 0(n~ 4 / 3 ), this implies 

' -K GUE W ( x , y)=n- ln 3 ir 2 (x - y) 2 p sc (x) 3 + 0(1) 



+ 0(1) 



+ 0(1) (2.10) 



Psc(x) '"" 6 

and the expected result for p GUE ^(x, x + u). □ 



The above discussion concludes the proof of Theorem 1.4 Its Corollary 



1.5 follows exactly in the same way as Corollary 1.2 from Theorem |1.1| 



Corollary |1.6| is a straightforward consequence of the scission of the limit- 
ing Levy measure: if x = {( a ji bj)} is a Poisson point process with measure 
Ex(^4 x B) = fi\(A) [12(B), conditionally to the b^s the a^s are distributed 
independently of each other, independently of the b^'s, as a Poisson point 
process with intensity proportional to p\. In particular, in our situation, 
the abscissa associated to the minimal ordinate is distributed with density 
proportional to 

(4-x 2 ) 2 . 



^note that the normalization in [TT] is different, as their semicircle law is supported on 
(— s/n, \/n). 
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3. Large gaps 

3.1. The unitary group: asymptotics of Toeplitz determinants. To 

evaluate the extreme gaps, we first investigate the queuing distribution of 
one given spacing. A large part of the literature concerns the probability of 
having no eigenvalues in a given interval (e.g. |13|[36j). A rigorous derivation 
of the queuing distribution for the large gaps was only given recently thanks 
to the steepest descent method for Riemann Hilbert problems [HI [23] or by 
operator theory tools for Toeplitz determinants |14j . 

The link with the nearest neighbor distribution is given by the following 
lemma, explained in [26], appendix A. 8. 

Lemma 3.1. Let u ~ l^u( n ), with eigenangles < 9\ < • ■ ■ < 9 n < 2tt, and 
£W=E?=i<^- Then 

nF(9 2 -e 1 >x) = -Ap^W( , x ) = o) . 

Moreover, the probability of having no eigenvalues in an arc of size 2a is 
equal to the Toeplitz determinant (this is a consequence of Heine's formula) 

(i r-2-iT-a \ 
— / jWdO . 

All the asymptotics we need in the following are direct consequences of the 
precise analysis of D n (a) . Such estimates were first given by Widom [36] , for 
the following equation (3.1) without the error term, (3.2) first by Krasovsky 
[24] and (3.1) together with (3.2) by Deift & al [9]: they prove that for some 
sufficiently large so and any e > 0, uniformly in so/n < a < ir — e, 



logL> n (a) = ra log cos- --login sin-) +c + O — . (3 1) 

2 4V 2/ \nsm(a/2)J 

d , „ , x n 2 a I a _ / 1 \ .„ „. 

— \ogD n {a) = - tan- - -cot- + . (3.2) 
da 2 2 8 2 \rasm [a/2) J 

for an explicit constant cq, which remained conjectural for 20 years. From 
this, we can give upper bounds on nP (O2 — 0\ > u), the expectation of the 
number of gaps greater than u. 



Lemma 3.2. Given any a > ande > 0, uniformly inu £ (a^logn/n, 2n- 
e), as n — ^ 00 

nF(8 2 -9 1 >u) <un 2+0 ^e- n2! £. 
Proof. From Lemma |3.1[ 



nP(0 2 - Ox > u) = -~D n (a) = -\ f ^logZ) n (a) j Djn i 



evaluated for u = 2a The first term is evaluated thanks to (3.2): it is 
Q(u n 2+0 ^) uniformly in (ay/log n/n, ir — e). Concerning the second term, 
(3.1) and the inequality logcosx < —x 2 /2 on [0, 7r/2) imply that it is 
g-n a /8 n 0(l)^ conc l uc ling the proof. □ 
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Note that, substituting u = 2a 



in the asymptotics (3.1) and 



(3.2) yields 



nP (6 2 - 0! > u) = n l -wi +0{ - l \ (3.3) 
where the 0(1) terms are uniform in A € [a, b], for any given positive a and 



b. Hence, if A > 32, the expected number of gaps greater than 



VA log n 



goes 



to 0. If, A < 32, this goes to oo. The transition in the number of large 
gaps at A = 32 is a strong clue for Theorem L3 but it is not sufficient: 
concluding the proof requires additional knowledge about the correlations 
between gaps, Lemma [378] , assumed for the moment. Take e > and p > 0. 
We note 



X„ 



n 



-y/321ogn 



r: 



where L = n o(1) . Then 



E(|X n - l| p ) <e p + F(X n <l-e, 

+E(\x n -in Xn>1 ^ <n/3 ) + 



2im 



7< n) >n/2), (3.4) 



v / 3Tlogn / 

and we aim to show that the last three terms converge to (the value tt/2 
is arbitrary, any angle strictly smaller than tt would be appropriate for this 
proof). First, decompose the unit circle into 8 fixed a ngle s of size ir/4. If 
> vr/2, on of these arcs is free of eigenvalues. From (3.1 ), the probability 
of this event decreases exponentially, so the last term in (3.4) converges to 
0. 

Moreover, integrating by parts, 



E ( \X„ — l\ p t 



X n >l+e,T 1 (n) <7r/2 



/ p{u-l) p - l W{X n > u,Tl n) < i:/2)du+pe p ~ l ¥{X n > l+e,T^ n) < tt/2) 

Jl+e 

l)P-ip(X n > ujdu + pe^FiXn > 1 + e) 



< 



V32 1ogn 2 



p(u 



l+e 



because X n needs to be shorter than 



<y( n ) _ rp^g probability that X n 



\J32\ogn 

is greater that u is obviously shorter than nP (62 — 6\ > u\/32 log n/n), the 
expectation of the number of gaps greater than ^^32 log n/n. Hence the 



above quantity goes to thanks to the uniform estimate of Lemma 3.2 

Finally, showing that P(AT n < 1— e) — > requires an additional argument, 
the negative correlation property for empty sets events, Lemma 3.8 In 
particular, this negative correlation implies the following result. 

Lemma 3.3. Consider a set of disjoint arcs on the unit circle. Let M n be 
the number of such intervals free of eigenangles, i.e. those Ik's such that 
£( n )(I fc ) = 0. Then Var(M„) < E(M n ). 
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Proof. This is a straightforward consequence of Lemma 3.8 as is a 
determinantal point process, for disjoints / and J, F(^ n >(I U J) = 0) < 
F(^ n \lj) = 0)F(C (n) (I k ) = 0). Hence, noting m n the number of initial 
arcs, 

e(m„ 2 ) = Yl n&\i j ^h) = o) 

l<j,k<m n 

< 2 Yl n& ) (i j ) = o)m (n Hh) = o)+ n& ] (ij) = 

l<j<k<nin l^j^mn 

< E(M n ) 2 + E(M„), 

as expected. □ 

Consider now a number m n of disjoint intervals I±, . . . , I mn of length (1 — 
£ ^ V32^iogn in ^ We can find mn = \2im/y/32 log nj of them. Let M n 
be the number of such intervals free of eigenangles, i.e. those Ik's such that 
^ n \lk) = 0. If there are less than £ n gaps larger than (1 — e) vl^jsg j^ either 
there are less than £ n intervals Ifc's free of eigenangles, or there is a gap 
between successive eigenangles containing two intervals: 

F(X n <1-E)< F(M n < in) + P f 7^° > 2(1 - £ ) ^ ^ " 

\ n 

This last term is bounded by the expectation of the number of gaps greater 
than 2(1 - g ) V32togn , from ^ thig goeg tQ Q if ^ _ £ ) > j ( true for £ 

sufficiently small). 



Concerning P(M n < ^ n ), first note that by the estimate ( |3.3[ ), E(M n ) = 
n i-(i-e) 2 + (i) ! go ^ = (E(M„)). This allows to use Chebyshev's inequality, 
for sufficiently large n: 

P(M n < i n ) < F(\M n -E(M n )\ > E(M n )-£ n ) < ( J^ Mn ] < E( U " ' 



(E(M n ) - 4) 2 " (E(M n 



where we used Lemma 3.3 in the last inequality. This last term is equivalent 
to l/E(M n ), thus going to 0, which concludes the proof. 



3.2. The GUE: comparison of Fredholm determinants. For the proof 
of Theorem 1.7 precise asymptotics like (3.1) and (3.2) related to unitary 
groups are not available in the GUE context. This difficulty can be overcome, 
our main observation being Lemma |3.5| the probability that an interval 
is free of eigenvalues is equivalent in the GUE(n) and U(n) cases, up to 
a normalization, if the interval size is shorter than the expected extreme 
gap size. The proof relies on a comparison of the Fredholm determinants 
associated to iT GUE (™) and K v( - n l 
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The rest of the proof is similar to the one concerning U(n). Indeed, 

~ (n) 

consider an e > 0, p > 0, 7? the £ n th largest gap in /, and 



Xn 



O) 



\/321ogn 



' n inf / \/4 — 

We then decompose 

E(|X n -l| p ) <£*> + P(X n <l- £ )+E(|X n -ipi in>1+£ 
This last expectation is, by integration by parts, 

/oo 
pv^ 1 F(X n > 1 + v)dv + peP' 1 F(X n > 1 + e). 

The probability F(X n > 1+v) is lo wer t han the expectation of the number of 
gaps greater than (l+v)t n . Lemma 



3.6 



therefore yields E ( \ X n — l| p l 



L X n >l+e 

as n — > oo. 

Concerning ¥(X n < 1 — e), we proceed as for the unitary group: for 

1 = (a,b) with a < 6, consider fh n = [(b — a)/((l — e)t n )\ disjoint intervals 
of length (1 — e)t n included in /. Let M n be the number of these intervals 
free of eigenvalues. Then 

F(X n <l-e)< F(M n < l n ) + P(T 1 (n) > 2(1 - e)t n ) 



From Lemma 3.6 this last probability goes to as n — > oo if 2(1 — e) > 1, 
£ < 1/2. Moreover, from Lemma |3.7[ 

E(M n ) > n 5 (3.5) 

for some 5 > depending only on e and I. Moreover, from the negative 
correlation property Lemma |3.8| and the same reasoning as Lemma |3.3[ 



Var(M n ) < E(M„). 



(3.6) 



As £ n = n W, from (3.5) E(M n ) — £ n > for sufficiently large n, which 
allows to use Chebyshev's inequality: 



P(M„ < in) < P(|M n -E(M„)| > E(M, 



n) *-n. 



< 



Var(M n ) 



< 



E(Af n ) 



(E(M n ) - £ n y (E(M n ) - I 



the last inequality being (3.6). From (3.5) this last term is equivalent to 
l/E(M n ) and going to 0, as n — > oo, concluding the proof. 

Lemma 3.4. Let 5 n = 0(1). The following asymptotics hold for the unitary 
group and GUE kernels. 

(1) Uniformly for x, y in (0, 2tt) and \x — y\ = 0(5 n ), 
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(2) Leteo>0. Uniformly for x,y in (— 2+eo, 2— eo) and\x—y\ = 0(S n ), 

-J—K^in) ( y) _ ^(nnp (x)(x-y)) = Q / 1 
np sc {x) mrp sc {x)(x - y) \n 

Proof. For the unitary group, the kernel is explicit so 



sin n 




As \x—y\ = 0(5 n ) —7- 0, by expansion of sin at third order (^p) / sin (^p) — 
1 = 0((x — y) 2 ), which concludes the proof. 

Concerning the Gaussian unitary ensemble, the same type of asymptotics 
hold, being a direct consequence of formula (2.10). Note that when taking 
<5n = 0(l/n) the speed of convergence to the sine kernel is 1/n 2 for the 
unitary group, much better that 1/n for the GUE, whose correlation kernel 
is not translation invariant. □ 

Lemma 3.5. Let 5 n = 0(\/log n/n), eo > 0. Then uniformly for x in 
(-2 + e ,2-e ), 



p GUE(n) / x% g 



x, x H 7 



1 < i < n 

Psc(X) 

- P u ( n ) fa [0, 27r<y , 1 < i < n) | < n W _1 . 

Proof. By inclusion-exclusion, the probability that a determinantal point 
process with kernel X has no points in a measurable subset A is the Fredholm 
determinant (see e.g. Lemma 3.2.4 in pQ) 

00 (_ \\k r 

det(Id - -RTa) = 1 + > / det(K( Xil x,))dx 1 . . . dx k . (3.7) 

^ — ' k\ Ak kxk ' 

k=l JA 

To compare empty sets probabilities, we therefore need to compare Fredholm 
determinants. A classical inequality (see e.g. [T7| Chap IV, (5.14)) is 

| det(Id + A) - det(Id + B)\ <\A- £|ie 1+|A|l+|B| \ (3.8) 

where \T\\ is the trace norm of a nuclear operator T. However, for posi- 
tive (like A or B) operators T(f){x) = f K(x,y)f(y)dy the trace norm is 
J \K(x, x)\dx, but for non-positive operators, like A — B, the trace norm is 
difficult to evaluate, even for a compactly supported continuous kernel K. 
However, in such a case, the Hilbert-Schmidt norm is computable: 

\T\\ = J! \K(x,y)\ 2 dxdy. (3.9) 

This is the reason why, instead of the Fredhom determinant and the inequal- 
ity (3.8), we will use the modified Carleman- Fredholm determinant 

det 2 (Id + T) = det(Id + T)e" TrT 
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and the inequality 

|det 2 (H + A)- det 2 (Id + B)\ < \A - 5| 2 e^ (|A|2+|B|2+1)2 , (3.10) 

which can be found in |17| . Chap IV, (7.11). For our purpose, note that 
from (3.7), after a simple change of variables, the probability that there are 
no eigenvalues in [x,x + 5 n /(p sc (x))] is equal to 



y, 



-,x + 



in 



np sc (x) ' np sc (x) 



dyi 



1+Y / det ( -LtfGUEW / x + 

fr[ k\ J( nSn )k kxk \np sc v 

In the same way, the probability that there are no eigenvalues in [0, 2Tr5 n ] is 



k=l 



k\ 



detl^ u («)(^,% ) )d yi ...dy k . 



Hence the inequality (3.10) will be applied with A and B integral operators 
with respective kernel 



A(u,v) 



1 



-K 



GUE(n) 



and 



B(u,v) 



np sc (°- ri<5 ™) 
2tt 



x + 



ii 



x + 



np sc (x) np sc (x) 



n (0,ni5 n ) 



2vr 2tt 

— u, — V 
n n 



From Lemma 3.4 the infinite norm between the two kernels above is 0(n<5^) = 
0(logn/n), so by (3.9), integrating on a domain of area (n<5 n ) 2 = O(logn), 

\A - B\ 2 = (0((logn) 2 /n 2 ) O(logra)) 1 / 2 = 0((logn) 3 / 2 /n). 
Moreover, consider a parameter a n > and decompose 

\ A \l = J J \ A ( x ^y)\ 2l \x- y \<a n dxdy + J J \A(x,y)\H\ x _ y \ >an dxdy. 

From Lemma 3.4, if \x — y\ > a n then |^4(x,y)| is smaller than l/(7ra n ) + 
0(logn/n), and when \x — y\ < a n it is bounded by 1 + 0(logn/n). Conse- 
quently, 

\A\ 2 2 = 0(a„ v / logn) + 0(logn/a 2 ) = 0((logn) 2 / 3 ) 
by choosing a n = (logn) 1//6 . In the same way, \B\l = 0((logn) 2 / 3 ). Hence 



|det 2 (Id + A)- det 2 (Id + B)\ = O 



(logn) 3 / 2 



M(lognf/3) < n 0(l)-l 



n 



Finally, using once again Lemma 3.4 here on the diagonal, 



TiA 



A(x,x)dx = -n5 n + 0((log n) 3/2 /n), 



(3.11) 



(3.12) 
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and the same for TiB. Finally, write 

|det(Id + A)-det(Id + B)| 

< e Tr A |det 2 (Id + A) - det 2 (Id + B)\ + \e T * A ~^ B 



l| |det(Id + B) 



Formulas (3.11) and (3.12) show that the first term is bounded by n ^ 1 . 



Moreover, det(Id + B) is a proba bility, so bounded by 1, and the estimates 
of Tr^ and TiB, formula fl3.12| ), show that e TlA - TlB - 1 is O (log n/n), 
concluding the proof. □ 



Lemma 3.6. Let s(L) = inf j y/4 — x 2 , e > and a n = ^32 log n/n. There 
are some constants ci,C2 > depending only on e and I such that for any 
v > e and n > 1 

E \{i : Xi G /, A i+ i - Xi > (1 + v)a n /s{I)}\ < Cl rT C2V . 

Proof. The first step allows reasoning on fixed intervals instead of gaps. 
More precisely, note / = [a, b] with a < b, and consider the intervals of 
length (1 + Tj)a n /s(I) by successive slips of size s n = ^a n /s(I): 



a + ks n , a + ks n + 1 + 



a, 



0<k< Pn = [(b-a)/s n \. 



2 J 8(1) _ 

There an injective map associating to any eigenvalues gap of size at least 
(1 + v)a n / s(I) an interval Jk included in this gap, for example the one with 
lower index. Consequently, 

Pn 

E|{t: Xi € I,X i+1 -Xi > (l + v)a n /s(I)}\ < J^P(J fc = 0). 

where we use the abbreviation f (Jk = 0) = F (Xi Jfc,l < i < n). The 
second step consists in obtaining uniform upper bounds for these empty 
intervals probabilities. For this purpose, the negative correlation property, 
Lemma 3.8, is used by partitioning the interval J/%: 



where 



has length (1 — e')a n /s(I), with e' to be chosen positive but 
sufficiently small compared to e; 

r(2) 



• Jl has length (| + e')a n / 's(I); 

(i) 

• the intervals all have length 
alpha n / (2s(I)) and their number is q v 

• Mk is a residual interval. 

The negative correlation property yields 



[v 



pGUE(n) (Jfc = Q) < p GUE(n) ^(1) = Q \ p GUE(n) ^(2) = Q \ |j p GUE(n) (t 



Li) 



3=1 
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This now can be upper-bounded using Lemma 3.5 because all intervals are 
shorter than a n /s{I): for example, noting x one extremity of 

p GUE(n) ^(D = ^ = p U(n) ^ (1 _ e^^/JT^/^J)] = ) + n^~ l 

< pU(n) Q Q) (1 _ £ , )an] = Q ) + n 0(l)-l = n _(l- £ ') 2 +0(l) 

with 0(1) not depending on the index k, and where the last estimate relies 
on ( |3.1[ ). In the same way, 

p GUE(n) ^(2) = Q \ < „-(|+e') 2 4-0(l) 5 p GUE(n) (^ij) = ^ < n -±+0(l). 

Gathering all these results, 

E|{* : Aj G I,X i+1 - Xi > (l + v)a n /s(I)}\ < r^MMf +e'f-{\+0W)^ . 

We can chose e' sufficiently small such that 1 — (1 — e') 2 — (| + e') 2 < 
(e.g. e' = e 2 /8). For such a choice, for sufficiently large n not depending on 
v the above exponent is smaller than — c — | [v — e\ for some c > 0. Such a 
function is smaller than — civ on v > e, for some C2 > 0. □ 

Lemma 3.7. Let I = [a,b] with a < b, s(I) = inf/ \/4 — a; 2 , e £ (0, 1) and 
a n = ^/32 log ri/ra. Consider a maximal number 

m n = L(6-o)/((l-e)a n /a(/))J 

0/ disjoint intervals 1%, . . . , I mn of length (1 — e)a n /s(I) included in I. Let 

M n = \{1 < j < m n : Xi £ Ij, 1 < i < n}\ 

be the number of those intervals containing no eigenvalues. Then 

E(M n ) > n 5 

for some 5 > depending only on £ and L 

Proof. We first take a restricted interval to avoid the fluctuations in the 
spectral measure: there is /' = (a', b') C / with a' < b' such that 

■UP,V^? , (3.13) 
s(i) 2 

Then, for a given interval L^ = [x,x + (1 — e)a n / s(L)] included in from 
Lemma 13.51 



p GUE(n) (A . $l kf i<i< n ) = P u(n) (0i [0, (1 - e)V±-x 2 /s(L)}, l<i< nj+n ^ 1 

with 0(1) uniform in x. From (3.13) this is greater than 

P u(n) ^. ^ [ , (l - I)] , 1 < i < n) + n ^)- 1 = n-C 1 "!) 2 ^ 1 ). 

from (3.1). There are n 1+0 ^ intervals I^s included in so as n — > 00 

E(M n ) > n 1 -( 1 -f) a+0 ( 1 ) >n 5 
with 5 = (1 - (1 - e/2) 2 )/2 for example. □ 
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3.3. The negative association property. As previously noted, to deduce 
the asymptotics of the largest gaps, the correlation between distinct gaps is 
required. 

In the context of a point process on a finite set 5, let A and A' be distinct 
disjoint subsets of £ , and write A for the event that the elements of A are 
free of particles. Shirai and Takahashi showed that for determinantal 
point processes, the empty sets events are negatively correlated: 

P(0 AUA ') < P(0 A )P(0 A '). (3.14) 

This negative association property has received a considerable attention in 
the past few years, in the context of ASEP for example. Still for discrete de- 



terminantal point processes, formula (3.14) was generalized to all increasing 



events [25], and general criteria for the negative association property were 
given in [JJ. 



The following continuous analogue of (3.14) holds. It can be shown by a 
simple discretization, relying on results from [U 1251 129] . We give another 
justification, which relies on a work of Georgii and Yoo |16j . 

Lemma 3.8 (Negative correlation of the vacuum events). Let be the 

point process associated to the eigenvalues of Haar distributed unitary matrix 
(resp. an element of the GUEJ. Let I\ and L^ be compact disjoint subsets 
o/[0,2vr) (resp. R). Then 

P (e (n) (/i U I 2 ) = o) < P (V n) (/i) = o) P (f (n) (J 2 ) = 0) . (3.15) 

Proof. The general negative correlation result, Corollary 3.3 in [16J, requires 
a locally trace class operator K with a restriction on its spectrum. More 
precisely, consider the unitary case, the GUE proof, being similar. For an 
operator K acting on L 2 ((0, 2ir), //), if spec(AT) C [0, 1], there exist a unique 
determinantal point process £ with kernel K (see |31|): under the additional 
hypothesis spec(A') C [0, 1), Georgii and Yoo proved that for any compact 
and disjoint Borel sets A C A C [0, 2ir), 

P M (£(A) = | f (A/A) = 0) < P„ (£(A) = 0) . 
In the case of Haar distributed unitary matrices, K = K v ( n ' is a nuclear op- 
erator with kernel K^^ n \x,y) = 2~ S ™n™(x-y)/2) > an< ^ defines a projection: 1 
is in its spectrum, and the general statement does not directly apply. To care 
of this minor problem, look at the restriction K^ n ' of AT u ( n ) to a compact 

subset A of (0,2vr) (A^ (n) = P A K V ^P A , P A being the projection on A). 
Suppose that the set (0, 2tv)/A has a nonempty interior. As a projection of 

non-negative and trace class. As for any determinantal 

point process, 

E (z« (n) ( A )) = det(Id + {z- 1)< W ) (3.16) 

in the sense of Fredholm determinants of a trace class operator. Suppose 
that K^ n ^ has an eigenvalue A > 1. Then by choosing z = 1 — 1/A > 0, 
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( [316} yields 

E (z^W) = if A > 1, P (f (n) (J) = 6) = if A = 1. 



In each case, this is absurd because the joint law of the eigenvalues is ab- 
solutely continuous with respect to the Lebesgue measure on (0, 27r) ra , and 
(0, 27r)/A has a nonempty interior: both quantities need to be strictly posi- 
tive. Hence K^^j is a trace class operator with spectrum in [0, 1), and the 
result from [16] applies. □ 



Remark. Another way to prove Lemma 3.8 is as follows: the inequality 
can be stated for the determinantal point process with kernel aK^^ with 
< a < 1, and then the inequality remains true for if u ( n ) by continuity 
of the application K \-t det(Id — K) in the set of trace class operators, by 
(pl). 
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